Utilizing inter-passage and inter-document similarities for reranking search results
نویسندگان
چکیده
منابع مشابه
Investigating Usage of Text Segmentation and Inter-passage Similarities to Improve Text Document Clustering
Measuring inter-document similarity is one of the most essential steps in text document clustering. Traditional methods rely on representing text documents using the simple Bag-of-Words (BOW) model. A document is an organized structure consisting of various text segments or passages. Such single term analysis of the text treats whole document as a single semantic unit and thus, ignores other se...
متن کاملInter-document Similarities, Language Models, and Ad Hoc Information Retrieval
Search engines have become a crucial tool for finding information in repositories containing large amounts of textual data in unstructured form (e.g., the Web). However, the task of ad hoc information retrieval, that is, finding documents within a corpus that are relevant to an information need specified using a query, remains a hard challenge. The language modeling approach to information retr...
متن کاملEnhancement of Search Results Using Dynamic Document Seed Reranking Algorithm
We proposed an algorithm to improve the precision of top retrieved documents by reordering the retrieved documents in the initial retrieval. To re-order the documents, we first automatically extract key terms and key phrases from top N retrieved documents and generate a document index for each document. Using the standard similarity metrics, a document similarity matrix is generated for these d...
متن کاملInter-organizational Document Exchange
Information exchange processes are often characterized by the need of translating from one data format into another in order to achieve compatibility between information systems. A conversion problem often arises when exchanging files between applications of different software vendors or when incorporating legacy business data into new standard software. In this paper we want to survey the conv...
متن کاملInter-document Contextual Language model
In this paper, we examine the impact of employing contextual, structural information from a tree-structured document set to derive a language model. Our results show that this information significantly improves the accuracy of the resultant model.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: ACM Transactions on Information Systems
سال: 2010
ISSN: 1046-8188,1558-2868
DOI: 10.1145/1877766.1877769